Detecting Cross-Cultural Differences Using a Multilingual Topic Model
نویسندگان
چکیده
منابع مشابه
Detecting Cross-cultural Differences Using a Multilingual Topic Model
Understanding cross-cultural differences has important implications for world affairs and many aspects of the life of society. Yet, the majority of text-mining methods to date focus on the analysis of monolingual texts. In contrast, we present a statistical model that simultaneously learns a set of common topics from multilingual, non-parallel data and automatically discovers the differences in...
متن کاملIntegrate Multilingual Web Search Results using Cross-Lingual Topic Models
With the thriving of the Internet, web users today have access to resources around the world in more than 200 different languages. How to effectively manage multilingual web search results has emerged as an essential problem. In this paper, we introduce the ongoing work of leveraging a CrossLingual Topic Model (CLTM) to integrate the multilingual search results. The CLTM detects the underlying ...
متن کاملMultilingual and cross-lingual news topic tracking
We are presenting a working system for automated news analysis that ingests an average total of 7600 news articles per day in five languages. For each language, the system detects the major news stories of the day using a group-average unsupervised agglomerative clustering process. It also tracks, for each cluster, related groups of articles published over the previous seven days, using a cosin...
متن کاملVisualizing Cross-Lingual/Cross-Cultural Differences in Concerns in Multilingual Blogs
The goal of this paper is to cross-lingually analyze multilingual blogs collected with a topic keyword. The framework of collecting multilingual blogs with a topic keyword is designed as the blog feed retrieval procedure. Multilingual queries for retrieving blog feeds are created from Wikipedia entries. Finally, we present an interface for visualizing crosslingual/cross-cultural differences in ...
متن کاملMultilingual Topic Detection Using a Parallel Corpus
We have developed an approach for topic detection from multilingual news, in particular Chinese and English. We extract named entities such as people names, geographical location names, and organization names automatically from the news content by transformation-based linguistic taggers. These sets of named entities together with the remaining content terms form the basis of news representation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2016
ISSN: 2307-387X
DOI: 10.1162/tacl_a_00082